ADVANCES ON THE BESSIS- 
MOUSSA-VILLANI TRACE CONJECTURE 



CHRISTOPHER J. HILLAR 

Abstract. A long-standing conjecture asserts that the polynomial 

p{t) = Tr[{A + tBr] 

has nonnegative coefficients whenever m is a positive integer and A and B are 
any two n X n positive semidefinite Hermitian matrices. The conjecture arises 
from a question raised by Bessis, Moussa, and Villani (1975) in connection 
with a problem in theoretical physics. Their conjecture, as shown recently 
by Lieb and Seiringer, is equivalent to the trace positivity statement above. 
In this paper, we derive a fundamental set of equations satisfied by A and 
B that minimize or maximize a coefficient of p{t). Applied to the Bessis- 
Moussa- Villani (BMV) conjecture, these equations provide several reductions. 
In particular, we prove that it is enough to show that (1) it is true for infinitely 
many m, (2) a nonzero (matrix) coefficient of {A + tB)™ always has at least 
one positive eigenvalue, or (3) the result holds for singular positive semidefinite 
matrices. Moreover, we prove that if the conjecture is false for some m, then 
it is false for all larger m. 



1. Introduction 

In [T], while studying partition functions of quantum mechanical systems, a 
conjecture was made regarding a positivity property of traces of matrices. If this 
property holds, explicit error bounds in a sequence of Pade approximants follow. 
Let A and B he n x n Hermitian matrices with B positive semidefinite, and let 

^ Ty[exp {A -tB)]. 

The original formulation of the Bessis-Moussa- Villani conjecture asserts that the 
function (p^'^ is completely monotone; in other words, (j)"^'-^ is the Laplace trans- 
form of a positive measure ^^'^ supported on [0, oo): 



poo 

Ti[exp {A - tB)] = exp{-tx)dn^'^ (x) 
Jo 



Equivalently, the derivatives of the fimction f{t) — (j)^'^{t) alternate signs: 
(-ir/("HO>0, t>0, n = 0,l,2,.... 

Since its introduction in , many partial results and substantial computational 
experimentation have been given [31 IH [3 [H [71 [HI [131 [Il]j all in favor of the 
conjecture's validity. However, despite much work, very little is known about the 
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problem, and it has remained unresolved except in very special cases. Recently, Lieb 
and Seiringer in [llj , and as previously communicated to us 7 , have reformulated 
the conjecture of fT] as a question about the traces of certain sums of words in two 
positive definite matrices. In what follows, we shall use the standard convention 
that a positive definite (resp. positive semidefinite) matrix is one that is complex 
Hermitian and has positive eigenvalues (resp. nonnegative eigenvalues). 

Conjecture 1.1 (Bessis-Moussa-Villani). The polynomial p{t) = Ti [{A + tB)"^] 
has all nonnegative coefficients whenever A and B are n x n positive semidefinite 
matrices. 

Remark 1.2. Although not immediately obvious, the polynomial p(t) has all real 
coefficients (see Corollarv l2.4|) . 

The coefficient of t'^ in p{t) is the trace of Sm.k(A, B), the sum of all words of 
length m in A and B, in which k i?'s appear (it has been called the k-th Hurwitz 
product of A and B). In 7 , among other things, it was noted that, for m < 6, each 
constituent word in Sm.kiA, B) has nonnegative trace. Thus, the above conjecture 
is valid for m < 6 and arbitrary positive integers n. It was also noted in [7] (see 
also T) that the conjecture is valid for arbitrary m and n < 3. Thus, the first case 
in which prior methods did not apply and the conjecture was in doubt, is m = 6 
and n = 3. Even in this case, all coefficients, except Tr[>5'g^3(A, i?)], were known 
to be nonnegative (also as shown in [7j). It was only recently [8J, using heavy 
computation, that this remaining coefficient was shown to be nonnegative. 

Much of the subtlety of Conjecture [TTl] can be seen by the fact that Sm.k{A, B) 
need not have all nonnegative eigenvalues, and in addition that some words within 
the Sm,kiA, B) expression can have negative trace (see |l7j, where it is shown that 
TtlABABBA] can be negative). 

Our advancement is the introduction of a fundamental pair of matrix equations 
satisfied by A and B that minimize or maximize a coefficient of p{t). In what 
follows, we will be using the natural Euclidean norm on the set of complex n x n 
matrices: 

||A|| = Tt[AA*]^/^. 

(Here, C* denotes the conjugate transpose of a complex matrix C). The precise 
statement of our main result is the following. 

Theorem 1.3. Let m > k > be positive integers, and let A and B be posi- 
tive semidefinite matrices of norm 1 that minimize (resp. maximize) the quantity 
T^T-[Sm.k{A, B)] over all positive semidefinite matrices of norm 1. Then, A and B 
satisfy the following pair of equations: 

iAS„^-i,k{AB) = A^TT[AS,n-iMAB)] 
\BS„,-i^k-i{A,B) = B^TT[BSrn-i,k-iiA,B)]. 

We call (jl.ip the Euler- Lagrange equations for Coniecture ll.il The name comes 
from the resemblance of our techniques to those of computing the first variation in 
the calculus of variations. We should remark that there have been other variational 
approaches to this problem [51 [3] ; a review can be found in |14j . Although we are 
motivated by Coniecture ll.li we discovered that these equations are also satisfied 
by a minimization (resp. maximization) over Hermitian matrices A and B of norm 
1 (see Corollarv 13. 7p . and it is natural to consider this more general situation. In 
this regard, we present the following application of the Euler-Lagrange equations. 
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Theorem 1.4. If A and B are Hermitian matrices of norm 1 and m> 1, then 

\TT[SmAAB)]\ < 

Moreover, if m > k > 0, then equality holds only when A — ±i?, and if in addition 
m > 2, then A has precisely one nonzero eigenvalue. 

Remark 1.5. When m = 1, this theorem fails to hold. For example, let A be the nxn 
diagonal matrix A = diagin-^/"^ , . . . Then \\A\\ = 1, but Tr[S'i,o(A, B)] = 

Tr[A] = > 1 for n > 1. 

It is easy to see that this maximum is at least (™) , and using elementary con- 
siderations involving the Cauchy-Schwartz inequality, one can show that 

\Tv[S.mAAB)]\ < \\S„,MAB)\\n'^' < (^™) = 

However, we do not know if a dependency on the size of the matrices involved can 
be removed without appealing to equations (|l.ip . 

As a strategy to prove Coniecture ll.il we offer the following. 

Conjecture 1.6. Let m > k > be positive integers. Positive semidefinite (resp. 
Hermitian) matrices A and B of norm 1 that satisfy the Euler- Lagrange equations 
commute. 

From this result, Conjecture 11.11 would be immediate. Of course. Theorem 11.41 
implies that Conjecture 11.61 holds for the case of Hermitian matrices. We next list 
some of the major consequences of the equations found in Theorem 11.31 The first 
one implies that counterexamples to Con j ecture 1 1 . 1 1 are closed upwards. The precise 
statement is given by the following. 

Theorem 1.7. Suppose that there exist integers M,K and nxn positive definite 
matrices A and B such that Tt[Sm,k{A, B)] < 0. Then, for any m > M and 
k > K such that m — k > M — K , there exist nxn positive definite A and B 
making TT[Sm,k{ A, B)] negative. 

Corollary 1.8. If the Bessis-Moussa-Villani conjecture is true for some vtlq, then 
it is also true for all m < m^. 

This reduces the BMV conjecture to its "asymptotic" formulation. 

Corollary 1.9. If the Bessis-Moussa-Villani conjecture is true for infinitely many 
m, then it is true for all m. 

A next result characterizes the BMV conjecture in terms of the eigenvalues of 
the matrix Sm,k{A, B). 

Theorem 1.10. Fix positive integers m > k and n. Then, TT[Sm,k{A, B)] > for 
all n X n positive semidefinite A and B if and only if whenever Sm,k{A, B) ^ 0, it 
has at least one positive eigenvalue. 

Remark 1.11. This theorem can be viewed as a transfer principle for the BMV 
conjecture: instead of proving positivity for the sum of all the eigenvalues, we need 
only show it for at least one of them. 
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Our final result generalizes a fact first discovered in [8, (there only the real case 
was considered) , and it implies that it is enough to prove the Bessis-Moussa-Villani 
conjecture for singular A and B. 

Theorem 1.12. Let to, n be positive integers, and suppose that Tr [{A + tB)"^~^'\ 
has nonnegative coefficients for each pair of n x n positive semidefinite matrices 
A and B. If p(t) = Tr [{A + tB)™] has nonnegative coefficients whenever A, B are 
singular n x n positive semidefinite matrices, then p{t) has nonnegative coefficients 
whenever A and B are arbitrary n x n positive semidefinite matrices. 

The organization of this paper is as follows. In Section [21 we recall some facts 
about Hurwitz products, and in Section [3] we derive the two equations found in 
Theorem 11.31 Finally, in Section HI we use these equations to prove our main 
Theorems [Til Ell [TTOl and [TTl 

2. Preliminaries 

We begin with a review of some basic facts involving Hurwitz products; some of 
this material can be found in [S]. The coefficients Sm,k{^, B) may be generated via 
the recurrence: 

(2.1) S^^k{A,B) = AS,n-iAA,B) + BS„,-i,k-i{A,B). 

The following lemmas will be useful for computing the traces of the Sm,k- 

Lemma 2.1. Fix integers m > k > 0. For any two n x n matrices A and B, we 
have 



Tr[5™,fc(A,S)] 



m — k 

Proof. Consider the following chain of equalities: 



Tr[A5™_i,fc(A,B)] 



= Tr 



Tr 



^ (A + tBf-^ {A -A){A + tBf 



Tr 
Tr 



mA {A + tB) 



mA {A + tB) 



mA {A + tB) 



Tr 

Tr 

d 
dy 



"^{A + tB)'''^ A{A + tB) 

(Ay + tB) 
[Tr {Ay + tB)' 



i=l 

' d_ 
dy 



y=i 



Since Sm,kiAy,B) 



'Sm,k{A, B), it follows that the coefficient of t'^ in the 



last expression above is just 

TOTr[A5„_i,fc(A,S)] - (m - k)TT[S^^k{A, B)]. 
This proves the lemma. 



□ 



Lemma 2.2. Fix integers m > k > 0. For any two n x n matrices A and B, we 
have 

777 

Tr [5,„,fe(A, B)] = -Tr [BS^-i^k-iiA, B)] . 

Proof. Follows from Lemma \TJ\ by taking the trace of both sides of equation (|2.ip . 

□ 
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Let A and B he n x n Hcrmitian matrices. Since Sm,k{A, B) is the sum of all 
words of length m in yl and B with k S's, it follows that the conjugate transpose of 
Sm,kiA, B) simply permutes its constituent summands. This verifies the following 
fact. 

Lemma 2.3. If A and B are nxn Hermitian matrices, then the matrix Sjn,ki^, B) 
is Hermitian. 

Corollary 2.4. The polynomial p{t) = Tr [(A + t_B)™] has all real coefficients 
whenever A and B are nxn Hermitian matrices. 

Although Sm^k{A, B) is Hermitian for Hermitian A and B, it need not be positive 
definite even when A and B are nxn positive definite matrices, n > 2. Examples 
are easily generated, and computational experiments suggest that it is usually not 
positive definite. Finally, we record a useful fact about positive definite congruence. 

Lemma 2.5. Let C be any complex nxn matrix and let A be an n x n positive 
semidefinite matrix. Then CAC* is positive semidefinite. 

Proof. See p. 399]. □ 

3. Derivation of the Euler-Lagrange Equations 

The arguments for our main theorems are based on a variational observation, ft 
says that an expression TT[Sm,kiA, B)] is minimized or maximized when A and B 
satisfy the Euler-Lagrange equations (see Corollarv l3.6p . Before presenting a proof 
of this fact, we give a series of technical preliminaries. 

Proposition 3.1. Let m > k > be positive integers. Fix B to be any Hermitian 
nxn matrix, and suppose that A is a positive semidefinite matrix of norm 1 that 
minimizes (resp. maximizes) 

over all positive semidefinite matrices of norm \. Let e > 0, and let C :— C'{x) = 
{crs{x)) be an n X n matrix with entries Crs{x) ~ Ursix) + ivrs{x) in which Urs 
and Vrs are differentiate functions Urs,Vrs ■ [^e,e] R. Moreover, suppose that 
C(0) = / and CAC* ^ for all x G [— Then, the following identity holds: 

d ( CAC* 



Tr 



dx \ \\CAC* 



S,„_i,fc(A, B) 



= 0. 

x=0 



Proof. Let A, B, and C be as in the statement of the theorem. Keeping in mind 
Corollarv l2.41 we may consider the differentiable function / : [— e,e] M given by 



fix) - Tr 



CAC* 

^m.k I ■'cAc*r 



By hypothesis, the minimum (resp. maximum) of / is achieved at a; = 0. Conse- 
quently, it follows that 

(3.1) ^ 

dx 



= 0. 

x=0 



Next, notice that, 



Tr 



E 



CAC* 
ICAC* 
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i-1 



y-- d f CAC* 

J dx \ \\CAC*\ 
When X = 0, the above expression evaluates to 

Tr 



tB 



CAC* 
\CAC* 



tB 



dx VII CAC* 



(A + tB) 



x=0 



Tr 



CAC 



dx V II CAC 



It follows, therefore, from (|3.ip that 
'd_ / CAC* 
di \ \\CAC*\ 
This completes the proof. 



Tr 



{A + tB) 
5,„_i,fc(A, B) 



x=0 



= 0. 



x=0 



□ 



A corresponding statement can be made by fixing A and minimizing (resp. max- 
imizing) over B. 

Proposition 3.2. Let m > k > be positive integers. Fix A to be any Hermitian 
n X n matrix, and let B be a positive semidefinite matrix of norm 1 that minimizes 
(resp. maximizes) 

TliSrnAA B)] 

over all positive semidefinite matrices of norm 1. Let e > 0, and let C :— C{x) — 
{crs{x)) be an n X n matrix with entries Crs{x) — Urs{x) + iv^six) in which u^s 
and Vrs ciTc differentiable functions Urs^Vrs '■ [—e,e] —>■ M. Moreover, suppose that 
C(0) = / and CBC* ^ for all x G [— e,e]. Then, the following identity holds: 



Tr 



\-( 


' CBC* \ 


dx \ 


J|CSC*||j 



Sra-\.k-\{A, B) 



0. 



□ 



Proof. The proof is similar to that of Proposition 13. 1[ so we omit it. 

In our next lemma, we compute the derivative found in Propositions 13 . 1] and 13 . 2] 
For notational simplicity, the entry-wise derivative of the matrix C evaluated at 
the point a; = will be denoted by C. 

Lemma 3.3. With the hypotheses as in Provosition \3. 1[ we have 



d 

dx 



CAC* 
ICAC* 



C'A + AC* - Tt[C'A^]A - Tr[C'A2]A 



x=0 



Proof. A straightforward application of the product rule [lOj for (matrix) differen- 
tiation gives us that 

(3.2) 



dx 



CAC* 



\\CAC* 
Next, we compute that 

4- iicAc*ir' 

dx 



d_ 

dx 



1 



\CAC* 



CAC* 



1 



\CAC* 



^AC* 
dx 



CA 



dC* 
dx 



.-2 d 



ICAC* II — IICAC* 
dx 



\CAC*\r^ — ( Tt\CAC*CAC*]^/A 
dx \ ) 



= - (1/2) \\CAC*\r'^TY\CAC*CAC*\-^l'^—Tr\CAC*CAC*\. 

dx 



ADVANCES ON THE BESSIS-MOUSSA-VILLANI TRACE CONJECTURE 



7 



The product expansion of ^i:i[C AC* C AC*] occurring in this last line is: 

.dC*' 



Tr 



dC dC* 

—AC*CAC* + CA—CAC* 
ax dx 



dC 

CAC*—AC* 
dx 



CAC*CA- 



dx 



= 2Tr 



dC 

—AC*CAC* 
dx 



2Tr 



dC 

^AC*CAC* 
dx 



Finally, setting a; = and using the assumptions that \\A\ 
follows that 



1 and C(0) = /, it 



d_ 

dx 



CAC* 
\CAC* 



C'A + AC* - Ti[C'A^]A - Tt[C'A^]A 



x=0 



completing the proof of the lemma. 

We now have enough to prove the main results of this section. 



□ 



Theorem 3.4. Let m > k > be positive integers. Fix B to be any Hermitian 
n X n matrix, and let A be a positive semidefinite matrix of norm 1 that minimizes 
(resp. maximizes) 

Tr[^,„,fe(^,B)] 

over all positive semidefinite matrices of norm 1 . Then, the following identity holds: 

AS^^i,u[A,B) = A^TT[ASm-iAAB)]. 

Proof. Let A and B be as in the hypotheses of the theorem. By using different 
matrices C in the statement of Proposition 13.11 we will produce a set of equations 
satisfied by the entries of ASm-i.k{A, B) that combine to make the single matrix 
equation above. For ease of presentation, we introduce the following notation. For 
integers r, s, let Ej-s denote the n x n matrix with all zero entries except for a 1 in 
the (r, s) entry. 

Fix integers 1 < r, s < n and take C = I + xE^g . Since C is invertible for all 
X e [—1/2, 1/2], it follows that CAC* / for aU such x. Therefore, the hypotheses 
of Lemma [3.31 are satisfied. The formula there gives us that 

(3 3) 

^^■^> dx\\\CAC*\\ 



C'A + AC* - Tr[CA^]A - Ti[C'A^]A. 



Additionally, Proposition [5TT1 along with a trace manipulation, tells us that 

\Tt[CA^] +Tr[C'A2])Tr[v45„_i,fc(AS)] 

= Tr [C AS,n-i.k{A. B) + AC* S^-iAA, B)] 



(3.4) 



= Tr [C'A5™_i,fc(A, B)] + Tr [5™-i,fc(A, B)AC 



Tr [C'A^™_i,fc(A, B)] + Tr [C AS^-i.k{A, B)]. 



Since C — Ers, a computation shows that for any matrix N, the trace of CN is 
just the (s,r) entry of N. In particular, it follows from p.4p that the (s,r) entries 



of A5™_i,fc(A,B) + A5,„_i,fc(A,B) and {A^ + A^)TT[ASm-iAA: B)] coincide. We 
have therefore proved the following identity of matrices: 



(3.5) 



ASra-iAA B) + AS„,-i,M, B) = {A' + A2)Tr[A5,„_i,fc(A, B)]. 
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We next perform a similar examination using the matrices C = / + ixErs to 
arrive at a second matrix identity. Combining equation (13. 3p and Proposition 13.11 
as before, we find that 

(3.6) B) - AS,n-i,k{A,B) - {A^ - I^)Tr[^5™-i,fe(A, B)]. 

The theorem now follows by adding these two equations and dividing both sides of 
the result by 2. □ 

Similar arguments using Proposition 13 . 21 in place of Proposition 13.11 produce the 
following results. 

Theorem 3.5. Let m > k > be positive integers. Fix A to be any Hermitian 
n X n matrix, and let B be a positive semidefinite matrix of norm 1 that minimizes 
(resp. maximizes) 

Tr[5™,fc(A,B)] 

over all positive semidefinite matrices of norm 1 . Then, the following identity holds: 

BSm-i,k-i{A,B) = B^Tv[AS„,^i^k-iiA,B)]. 

Combining the statements of this section, we have finally derived the Euler- 
Lagrange equations (|l.ip for Coniecture ll.il 

Corollary 3.6. Let m > k > be positive integers, and let A and B be posi- 
tive semidefinite matrices of norm 1 that minimize (resp. maximize) the quantity 
TT[Sm,k{A, B)] over all positive semidefinite matrices of norm 1. Then A and B 
satisfy the following pair of equations: 

AS,n-iMA,B) = A2Tr[A5™_i,fc(Ai3)] 
BSm-i,k-i{A,B) = B^TT[BSm-i,k-iiA,B)]. 

We remark that our proof generalizes directly to show that the same equations 
hold for Hermitian minimizers (resp. maximizers), or more generally, for classes of 
unit norm matrices with the same inertia. This result is the main ingredient in our 
proof of Theorem 1 1 .41 concerning the maximmn of TT[Srn.k{A, B)]. 

Corollary 3.7. Let m > k > be positive integers, and let A and B be Hermitian 
matrices of norm 1 that minimize (resp. maximize) the quantity Tr[Sm,k{A, B)] 
over all Hermitian matrices of norm 1 . Then A and B must satisfy the following 
pair of equations: 

AS,n-iMA,B) = A^Tt[AS^-iMA,B)] 
BSm-i,k-i{A,B) = B^TT[BSm-i,k-iiA,B)]. 

In general, we conjecture that trace minimizers commute (Conjecture II. 6p . a 
claim that would imply Coniecture ll.il We close this section with one more appli- 
cation of the Euler-Lagrange equations. 

Corollary 3.8. Suppose that the minimum of TT[Sm,k{A, B)] over the set of posi- 
tive semidefinite matrices is zero, and let A and B be positive semidefinite matrices 
achieving this minimum. Then, Sm.k{A, B) ~ 0. 
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Proof. When k ^ m or k ~ 0, the claim is clear. Therefore, suppose that m > k > 
0. Let A and B be positive semidefinite matrices with Ti[Sm,k{A, B)] = 0. If either 
of A or _B is zero then the corollary is trivial. Otherwise, consider 

TT[S,nAAB )] 



= II .lln^-fc|lDllfe - ^n^mMA,B)\, 



in which A = A/\\A\\ and B = B/\\B\\. Combining equations (jl.ip with the assump- 
tions, it follows that ASm-i,k{A, B) = and BS„i-i,k-i{A, B) — 0. Moreover, 
equation (|2.ip implies that 



S^AA,B)^AS„^-lAA,B) + BS,n-l^k-l{A,B)=0. 
Multiplying both sides of this identity by ||yl||'"^''"||_B||''' completes the proof. □ 

4. Proofs of the Main Theorems 
We first use the Euler-Lagrange equations to prove Theorem 11.41 

Proof of Theorem \1.4\ Let m > 1 and n be positive integers. Since our arguments 
are the same in both cases, we consider determining the maximum of Tr[S'm,fe(^, B)]. 
Let M be the compact set of Hermitian matrices with norm 1 and choose {A, B) G 
M X M that maximizes Tr[S'„j,fc(^, B)]. If fc = 0, then the desired inequality is of 
the form 

n n 

Tr[A'"]<^|A.r<^A^ = P|Hl, 

i=l i=l 

in which Ai, . . . , A„ are the eigenvalues oi A. A similar argument holds for m — k. 
Therefore, we assume below that m > fc > 0. 

The Euler-Lagrange equations from Corollary [377] imply that 

(4.1) ASrn-lAA B) ^ A^TT[ASr,,^i{A, B)]. 

Performing a uniform, unitary similarity, we may assume that A is diagonal of 
the form A = diag(Ai, . . . , A^, 0, . . . , 0), in which Ai,...,Ar are nonzero. Let 
A — diag(Aj^^, . . . , A^^, 0, . . . , 0) be the pseudo-inverse of A, and set D = AA. 
Multiplying both sides of 14. II bv A, it follows that 

i?5™_i,fe(A,B) = ATr[A5™_i,fc(.4,B)]. 

Taking the norm of both sides of this expression and applying Lemma l2.1[ we have 

(4.2) !!i^Tr[5™,fc(A,S)] - \\DSra-x.k{A,B)\\ < \\Sm-i.k{A, B)\\ < f " ^ 

m \ k 

It follows that < TilSmAAB)] < ;;f^C\7^) = (") as desired. 

We next verify the final assertions in the statement of the theorem. From above, 
every inequality in the chain (j4.2p is an actual equality. Thus, each term occurring 
in 

\\Srn^,,kiAB)\\=Y,\\WiA,B)\\, 

w 

a sum over length m — 1 words W with k _B's, takes the value 1. In particular, we 
have that 1 = When m — 1 > fc > 1, an apphcation of Lemma |4T] 

below completes the proof of the theorem. The remaining cases fc = lorm = fc + l 
are dealt with as follows. 
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Without loss of generality, we may suppose that k ~ 1 (interchange the roles of 
the matrices A and B). Applying the Cauchy-Schwartz inequality, we obtain the 
following chain of inequalities, 

(n \ ^ n n 

< J2 i>Hr-'T.^^ < \\A\\\\B\\ = 1. 
i=l ) i=\ i=l 

It follows that each inequality above is actually an equality. In particular, the 
second to last identity says that B is diagonal. Moreover, equality in Cauchy- 
Schwartz implies that A™"^ = Sbu for some real number S and all i. Since 1 = 
\J2"=i ^r~^^"| = it follows that A = ±B. If in addition, m > 2, and A has 
more than 1 nonzero eigenvalue, then 

n n n 

j=l 1=1 i=l 

a contradiction. Therefore, the conclusions of the theorem hold for fc = 1, and this 
finishes the proof. □ 

Lemma 4.1. Suppose that A and B are Hermitian matrices of norm 1 and r > 
and s > 1 are integers such that \\A'^B'^\\ — 1. Then, A = ±B has only 1 nonzero 
eigenvalue. 

Proof. Performing a uniform, unitary similarity, we may suppose that B is a di- 
agonal matrix with entries less than or equal to 1 in absolute value. From the 
hypotheses, we have that 

l = \\A^B^\\<\\A^\\\\B^\\<\\Ar\\Br = l. 

Therefore, \\B''\\ — 1 = and since s > 1, this implies that B has a single 

nonzero eigenvalue. It follows that ||j4'"i?''|| = ||j4''i3|| = 1 is equal to the absolute 
value of the (1, 1) entry of A^. Finally, since WA^W = 1, the matrix A^ has only one 
nonzero entry, and therefore, A has only one nonzero eigenvalue. Thus, A'' = ±A 
and since — ±B, it follows that A — ±B. This completes the proof. □ 

The argument for our next result requires the following well-known fact; we 
provide a proof for completeness. 

Lemma 4.2. If P and Q are positive semidefinite matrices, then PQ has all non- 
negative eigenvalues. 

Proof. Suppose first that P is positive definite. Then PQ is similar to 

p-^/'^PQP^/'^ = P^/'^QP^/'^. 

In particular, PQ is similar to a positive semidefinite matrix by Lemma 12.51 There- 
fore, in this case PQ has all nonnegative eigenvalues. The general version of the 
claim now follows from continuity. □ 



We are now prepared to present a proof that counterexamples to Coniecture ll.il 
are closed upward. Theorem 11.101 closelv follows. 

Proof of Theorem \1.7\ Suppose that Conjecture [TTT] is false for some m and k and 
let A and B be real positive semidefinite matrices of unit norm that minimize 

m 777 

TT[SmMAB)] -TT[AS,n-iAAB)] = -Tr[B5„_i,fc_i(A,S)]. 

TTl fC rv 
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We show that for these same matrices A and B, we have Ti[Sm+i.k{A, B)] < and 

Tl[Sm+l,k+liA B)] < 0. 

Combining equation (|2.ip and using the identities p.ip from CoroUary 13.61 it 
foUows that 

(4.3) S„,^kiA,B) = A^TT[ASm-i,ki^,B)]+B^TT[BS^^i^k~iiA,B)]. 

This matrix is negative semidefinite since it is the sum of two such matrices. Hence, 
the product ASm^k{A, B) has all non-positive eigenvalues by Lemma [4.21 Thus, 
Lemma |2. II implies that 

777 

(4.4) Tr[5„+i,fc(A, B)] = -TT[AS,nAA B)] < 0. 

777 + 1 — K 

In the case of equality, multiplying equation (|4.3p on the left by A and taking the 
trace of both sides, it follows that 

= TT[A^]TiiASm-iAA B)] + TT[AB^]TiiBSrn-iM-i{A, B)]. 

However, Tr[y4._B^] > by Lemma 14.21 and since A is nonzero, we must have 
Tr[A'^] > 0. This gives a contradiction to equality in (|4.4|) . It follows that 
Tic[Sm+i,kiA,B)] < as desired. 

In the same manner, we can also prove that 

777 -1- 1 

Tr[S,„+i,fc+i(A,S)] = —L^TT[BS^,k{AB)] 

is negative. The conclusions of the theorem now follow immediately. □ 

Proof of Theorem ] 1 . 1 (A Suppose that TT[Sm,kiA, B)] can be made negative. The 
proof of Theorem 11.71 shows that there exist positive semidefinite matrices A and 
B such that Sm.ki^j B) is negative semidefinite and TT[Sm,kiA, B)] < (so that 
Sm,k{AT B) is nonzero). It follows that the second implication in the statement of 
the theorem is false. The converse is clear. □ 

Finally, we work out the proof of Theorem 1 1.121 the argument is similar in spirit 
to the proof of Theorem II. 41 

Proof of Theorem M.l'A Suppose we know that Coniecture ll.ll is true for the power 
771 — 1 and also suppose that for some k there exist n x n positive definite matrices 
A and B such that Ti-[Sm.k{A, B)] is negative. Clearly, we must have m > k > 
0. By homogeneity, there are positive definite A and B with norm 1 such that 
T'^[Sm,k{A, B)] is negative. Let M be the set of positive semidefinite matrices with 
norm 1 and choose {A, B) G M x M that minimizes Tr[Sm,k{A, B)]; our goal is to 
show that A and B must both be singular. 

Suppose by way of contradiction that A is invertible. The Euler-Lagrange equa- 
tions say that 

AS^_,,k{AB) = A^TT[AS,n-i.k{AB)]. 
Multiplying both sides of this equation by A^^ and taking the trace, it follows that 



TT[Sm-i,k{AB)] = TY[A]Tr[ASm-i,k{AB)]. 
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By hypothesis, TT[Sm-i,k{A, B)] is nonnegative. Therefore, using Lemma \2l] we 
have 

^^Tr[5™,fc(A,S)] = Ty[AS,n-i.kiAB)] 
m 

^ Tr[5„,,i^fc(A B)] 
Tt[A] 

>o, 

a contradiction (Tr[A] is nonzero since A is nonzero). It foUows that A must be 
singular. A similar examination with B also shows that it must be singular. 

Thus, if Coniecture ll.ll is true for singular A and B, it must be true for invertible 
A and B as well. This completes the proof of the theorem. □ 
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